Knowledge Transfer with Medical Language Embeddings

نویسندگان

  • Stephanie L. Hyland
  • Theofanis Karaletsos
  • Gunnar Rätsch
چکیده

Identifying relationships between concepts is a key aspect of scientific knowledge synthesis. Finding these links often requires a researcher to laboriously search through scientific papers and databases, as the size of these resources grows ever larger. In this paper we describe how distributional semantics can be used to unify structured knowledge graphs with unstructured text to predict new relationships between medical concepts, using a probabilistic generative model. Our approach is also designed to ameliorate data sparsity and scarcity issues in the medical domain, which make language modelling more challenging. Specifically, we integrate the medical relational database (SemMedDB) with text from electronic health records (EHRs) to perform knowledge graph completion. We further demonstrate the ability of our model to predict relationships between tokens not appearing in the relational database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Embedding Open-domain Common-sense Knowledge from Text

Our ability to understand language often relies on common-sense knowledge - background information the speaker can assume is known by the reader. Similarly, our comprehension of the language used in complex domains relies on access to domain-specific knowledge. Capturing common-sense and domain-specific knowledge can be achieved by taking advantage of recent advances in open information extract...

متن کامل

Constraining Word Embeddings by Prior Knowledge - Application to Medical Information Retrieval

Word embedding has been used in many NLP tasks and showed some capability to capture semantic features. It has also been used in several recent studies in IR. However, word embeddings trained in unsupervised manner may fail to capture some of the semantic relations in a specific area (e.g. healthcare). In this paper, we leverage the existing knowledge (word relations) in the medical domain to c...

متن کامل

Bilingual emb e ddings with random walks over multilingual wordnets

Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and then map them using bilingual dictionaries. In this work, we present a novel method to learn bilingual embeddings based on multilingual knowledge bases (KB) such as...

متن کامل

Biomedical Word Sense Disambiguation with Neural Word and Concept Embeddings

OF THESIS Biomedical Word Sense Disambiguation with Neural Word and Concept Embeddings Addressing ambiguity issues is an important step in natural language processing (NLP) pipelines designed for information extraction and knowledge discovery. This problem is also common in biomedicine where NLP applications have become indispensable to exploit latent information from biomedical literature and ...

متن کامل

The Effect of Knowledge of Result Feedback Timing on Speech Motor Learning in Healthy Adults

Objectives: The current study mainly aimed at studying the effect of Knowledge of Result (KR) feedback timing and result-estimation opportunity before receiving delayed KR on learning a new speech motor skill in monolingual healthy adults.  Methods: Thirty-nine Persian healthy adults were randomly divided into three groups. Each group received immediate KR, delayed KR (after eight seconds), or...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1602.03551  شماره 

صفحات  -

تاریخ انتشار 2016